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<110> APPLICANT : National Research Council of Canada 
Zou, Jitao 
Taylor, David C 
Wei, Yangdou 

Jako, Colette C Transferase Gene from Plants 

<120> TITLE OF INVENTION: Diacylglycerol Acyl Transferase be 

<130> FILE REFERENCE: 43922 

<140> CURRENT APPLICATION NUMBER : US 09/623 ,514A ^ 

<141> CURRENT FILING DATE: 2000-10-03 f) _ J 



<150> PRIOR APPLICATION NUMBER: PCT/CA99/01202 
<151> PRIOR FILING DATE: 1999 - "-16 
<150> PRIOR APPLICATION NUMBER: US 60/112,812 
<151> PRIOR FILING DATE: 1998-12-17 
<160> NUMBER OF SEQ ID NOS : 2 5 
<170> SOFTWARE: Patentln Ver. 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 1904 
<212> TYPE: DNA 

<213> ORGANISM: Arabidopsis thaliana 

<400> SEQUENCE: 1 a ff,aattct atttcctctt 60 

atttcttagc ttcttccttc aatccgctct ttccctctcc attagattct £tt ^ 

tcaatttctt ctgcatgctt ctcgattctc tctgacgcct °"tt ggtgaC ggag 180 

cgtcaaacgc ttttcgaaat Jjcgattttg jattctgctg ^gttacta^ gg g^g ^ 
aacggtggcg gagagttcgt cgatcttgat aggcttcgtc ^ W tgatgt tgga 300 

tcttctaacg gacttcttct ctctggttcc v ai - a „nna*aacac tcaqggaaca 360 

gctcccgccg acgttaggga tcggattgat tccgttgtta acgatgacgc tcaggg ^ 
gccaatttgg ccggagataa taacggtggt ^gataata acggtggtgg g gg g^ ^ 
ggagaaggaa gaggaaacgc egatgctjcg tttacgtatc gccgtw 54Q 
cggagggcga gagagagtcc acttagctcc ^cgca tcat catC gaaaat 600 

ttattcaacc tctgtgtagt agttcttatt ^tgtaaaca ^agac * tgcga 66 0 

cttatgaagt atggttggtt gatcagaacg gatttctggt "agttcaag 
gattggccgc ttttcatgtg ttgtatatcc ctttcgatct ttcctttggc tgcct g 
gttgagaaat tggtacttca gaaatacata cag a ct ttgtcatctt tct^^ ^ 
attatcacca tgacagaggt tttgtatcca guw a tataqctaaa gttggtttct 900 
tttttatcag gtgtcacttt gatgctcctc "ttgcattg tgtggctaaa g JJ^ g60 
tatgctcata ctagctatga cataagatcc ctagccaatg ^tg tcccacattg 1020 
gaagtctcct actacgttag cttgaagagc "ggcatatt tcatggtcgc tccc g ^ 
tgttatcagc caagttatcc acgttctgca tgtatacgga a 999"9*£ 99 £ 
t?tgcaaaac tggtcatatt caccggattc atgggattta taatagaaca £££ 
cctattgtca ggaactcaaa gcatcctttg aaaggcgatc ttctatatgc 
gtgttgaagc tttcagttcc aaatttatat £gtggctct ^atgttc^ c ^ 
cacctttggt taaacatatt ggcagagctt ctctgcttcg ggg 9 * acctqttcat 1380 
g at tg ,t„, .t*~»£ ££££ So —=tc 1440 

£2££ «XtS" £™ £™? s .t r « IIS 
SSSS S£S£ SSS A- i«. 
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56 atcttctgca ttttcggaca accgatgtgt gtgcttcttt attaccacga cctgatgaac 1680 

57 cgaaaaggat cgatgSatg aaacaactgt tcaaaaaatg actttcttca "catctatg 1740 

58 gcctcg^gg atctccgttg atgttgtggt ggttctgatg ctaaaacgac ^atagtgtt 1800 

59 ataaccat?" aagaagaaaa gaaaattaga gttgttgtat ctgcaaaaat tttggtagag I860 

60 acacgcaaac ccgtttggat tttgttatgg tgtaaagcgg ccgc 

63 <210> SEQ ID NO: 2 

64 <211> LENGTH: 520 

65 <212> TYPE: PRT 

66 <213> ORGANISM: Arabidopsis thaliana 

S Leu Lp ser ,1. 01, V,! Thr nr «1 T h r 8l u ». «l7 

5 10 
72 Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 

75 Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 

40 45 



Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg He Asp 

55 60 



76 35 
78 Pro Ser Asp 

11 Ser vTl Val Asn Asp Asp All Gin Gly Thr Ala Asn Leu Ala Gly Asp 

84 Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 

85 90 

85 Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 



105 HO 



87 

90 Ala His Arg Arg Ala Arg Glu Ser Vro Leu Ser Ser Asp Ala lie Phe 

91 115 120 



88 100 
90 Ala His Arg Arg 

93 Lys Gin III His Ala Gly Leu Vhl Asn Leu Cys Val Val Val Leu He 

135 140 

96 Ala III Asn Ser Arg Leu lie lie Glu Asn Leu Met Lys Tyr Gly Trp 

97 145 150 155 » m 

99 Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 

100 165 170 13 5 _ 

102 Pro Leu Phe Met Cys Cys He Ser Leu Ser lie Phe Pro 

103 180 ™* 190 

105 Phe Thr Val Glu Lys Leu 

106 195 

108 Val He Phe Leu His He 

109 210 

111 Val Tyr Val Thr Leu Arg 

112 225 230 



115 245 

117 His Thr Ser Tyr Asp 

118 260 

120 Asn Pro Glu Val Ser 

121 275 

12 3 Met Val Ala Pro Thr 
124 290 



He 
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185 
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200 
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He 
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220 
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Ser 
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Phe 


Leu 


Ser 








235 






He 


Val 


Trp 


Leu 
250 


Lys 


Leu 


Val 


Arg 


Ser 


Leu 
265 


Ala 


Asn 


Ala 


Ala 


Tyr 


Val 


Ser 


Leu 


Lys 


Ser 


Leu 


280 










285 


Cys 


Tyr 


Gin 


Pro 


Ser 


Tyr 


Pro 


295 










300 
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SAW SEQUENCE LISTING ™TE: ^l' 2 ™ 1 

PATENT APPLICATION : US/09/623 , 514A TIME: 14:00:18 

Input Set : A:\43922new.txt 

Output Set: N:\CRF3\08032001\l623514A.raw 

310 315 320 



III Phe Thr Gly Phe Met Gly Phe lie lie Glu Gin Tyr He Asn Pro lie 

330 ° 
Leu Lys Gly Asp Leu Leu 
345 

Val Pro Asn Leu Tyr Val 
360 365 



III Val Arg Asn Ser lJs His Pro Leu Lys GlJ Asp Leu Leu Tyr Ala lie 
133 340 345 



135 Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 

He 
380 



138 Met Phe Tyr Cys Phe Phe His ieu Trp Leu Asn lie Leu Ala Glu Leu 

375 JoU 
ifl Leu Cys Phe Gly As P Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 



III sir Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 

1/|C . 405 410 

\\l Met Val Arg His lie Tyr Phe Pro Cys Leu Arg Ser Lys lie Pro Lys 
150 Thr Leu Ala lie lie He Ala Phe Leu Val Ser Ala Val Phe His Glu 
153 Leu Cys lie Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 



455 



1 5 56 Gly lie Met Phe Gin Val Pro Leu Val Phe lie Thr Asn Tyr Leu Gin 
III Glu Arg Phe Gly Ser Thr Val Gly Asn Met He Phe Tr P Phe He Phe 



III Cys He Phe Gly Gin Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
163 500 505 

165 Met Asn Arg Lys Gly Ser Met Ser 

166 515 520 

170 <210> SEQ ID NO: 3 

171 <211> LENGTH: 5193 

172 <212> TYPE: DNA 

173 <213> ORGANISM: Arabidopsis thaliana 

III £t™= E ca™c *ttcc.ttt, gtttt.ttt, tttcaa.gtt taat.ttcct 60 
177 Ltgtataac attcaaatot tcacatgatt gattgtgtga aa.ccccaca jattttaota 120 

-9 ssek » ~g sssss r t = 

180 aaaaaataJg tattgttaat cttaaaaatg taggagtaca catcaaatac tcgagcataa 00 

181 tcaaaaccg? attcatagac cgatgtgaga atcaaataga gataatgtg ^tttttaaa 60 

182 atatcgtatc tccaaatcaa tcacttagaa gataatgtaa ttctttatgt ^tacaraaa 
Tfi3 taaatatata tatatatata tatatatatc ttgtatatat gtcttgacaa aaaattgcca 480 

183 taaatatata wuwu atcaaattga atcaaactat aagtcggatg 540 
iS a«,«"« lllltl ?,t tt" 'tacalaccg, aa.at.gata tt.tag.tac 600 

• g.t«gtg« tattatt.g. agatttgg.a tttcatcatt =tc.gga-t 

187 a.agtacttc ccta.tt.aa tc-.tgtcggt tga..a.gc « "ftftt"^ 780 

■ iS VZZZZ t"a£S ™£ SStUJ t.tctttg.t = ..t SJO 
190 c.tttc»t.t tctattttga tgttu.ga. aacactattt a^agttaca """"" IZ 
III SSK£ "aSg.15 ESSE aSfatc". USES 10 2 0 
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RAW SEQUENCE LISTING DATE: 08/03/2001 

PATENT APPLICATION: US/09/623 , 514A TIME: 14:00.18 

Input Set : A:\43922new.txt 

Output Set: N:\CRF3\08032001\I623514A.raw 

193 tccaagttta taataaatac atttcaaaga ctattagttc ttcttaaaat atttctaaaa 1080 
ill gtgatcaaag actaccacat ataattcaga aaaagtagaa gttgatttct ttttgtcaaa 1140 
195 ?aaataattg acttaaaata gtttggaaag ccattgaact tgattataga attgataatg 1200 
ill tacataaaaa aattccaagt ttataataaa tacatttttc aaatgctata tcagttcttc 1260 

197 ttaaaatatt tcactaaaaa aacactcaaa tatagaataa atttattgaa taacatacca 1320 

198 actgtaaaac agaatttgac aaaaaaaaaa aaaaaatgaa atgaagatga agacaaaaat 1380 

199 aaatcaccag aggatcttat gcaaaaaaat atatgaatac acaataaacc atattgatat 1440 

200 ttttaaaata aaataaaaac agaaaaatat cccaacaccg cttttcaatt aaaaatcttc 1500 
01 cg^caccatt gttgtcatct tcctctctcg tgaatccttt ttcctttctt cttcttcttc 1560 

202 tcttcagaga aaactttgct tctctttcta taaggaacca gacacgaatc ccattcccac 1620 

203 cgatttctL gcttcttcct tcaatccgct ctttccctct ccattagatt ctgtttcctc 1680 

204 t?tcaatttc ttctgcatgc ttctcgattc tctctgacgc ctcttttctc ccgacgctgt 1740 
5 ttcgtcaaac gctt?tcgaa atggcgattt tggattctgc tggcgttact jcggtgacgg 1800 

206 agaacggtgg cggagagttc gtcgatcttg ataggcttcg tcgacggaaa tcgagatcgg 1860 

207 attcttctaa cggacttctt ctctctggtt ccgataataa ttctccttcg gatgatgttg 1920 
08 gagctcccgc cgacgttagg gatcggattg attccgttgt taacgatgac ^ctcagggaa 19 

209 cagccaattt ggccggagat aataacggtg gtggcgataa taacggtggt ggaagaggcg 2040 

210 gcggagaagg aagaggaaac gccgatgcta cgtttacgta tcgaccgtcg Sttccagctc 2100 

211 atcggagggc gagagagagt ccacttagct ccgacgcaat cttcaaacag Stttaaaatc 2160 

212 Tctllalllt Lgaatttgg tgtttgcttg ttgttttata tggaattgag tttggtgatt 2220 

213 gtt?tgcatt gcagagccat gccggattat tcaacctctg tgtagtagtt ettattgctg 2280 

214 taaacagtag actcatcatc gaaaatctta tgaaggtttg ctgttacttg "tctccttt 2340 

215 taggaattga attgcttgaa aatttatcag agacgaataa ctttgttgtt gctatcattc 2400 
111 atgLgtaJg gttggttgat cagaacggat ttctggttta gttcaagatc ^gcgagat 2460 

217 tggccgcttt tcatgtgttg gtaaaagaag atgtttttta tttccagcaa tgttacattg 2520 

218 ttatacgtat aatgatgagt ttagtgatca agttcctctt tgattcttct ttcttgttgc 2580 
2" a£ata?ccc tttcgaLtt tcctttggct gcctttacgg ttgagaaatt ^acttcag 

220 aaatacatat cagaacctgt gagtaattac tattctccag ccattactgt ""tttatt 2700 

221 gaagacaagt ttgtatcatg aagaacttac aagttctgtt ttgaaaatgc tcaaggttgt 2760 

222 catctttc?t catattatta tcaccatgac agaggttttg tatccagttt «*tcaccct 2820 

223 aaggtgatac tgtttttctg gtctcagttt gtgatactgt ttttaagttt ^tgtctga 2880 

224 cccggtgatc ttgaaaatgg acaggtgtga ttctgctttt ttatcaggtg tcactttgat 2940 

225 gc^cctcact tgcattgtgt ggctaaagtt ggtttcttat gctcatacta gctatgacat 

226 aagatcccta gccaatgcag ctgataaggt aaaatacgaa aaagaagcgt atgtattagt 3060 

227 cacttgcact gtgttactgt tttaaccaaa cactgttatg aactttaggc caatcctgaa 3120 

228 gtctcctact acgttagctt gaagagcttg gcatatttca tggtcgctcc -cattgtgt 10 

229 tatcaggtaa ctgcaaagtg catcaaccat tcttatactt gcaagagttt cttgtctaaa 3240 
23 cStcggatct ttgctttLc ccagccaagt tatccacgtt ctgcatgtat acggaagggt 300 

231 tgggtggctc gtcaatttgc aaaactggtc atattcaccg gattcatggg atttataata 3360 

232 gaacaagtac gttttcacat cttgctttat tagttttcct tggtgaaaat catcatccct 420. 

233 gcgttgtcac cacttgactt catgttcttt tgttacattt tggcagtata taaatcctat 3480 

234 tgtcaggaac tcaaagcatc ctttgaaagg cgatcttcta tatgctattg aaagagtgtt 3540 
• 235 gaagc^tca gttccaaatt tatatgtgtg gctctgcatg ttctactgct tcttccacct 600 

236 ttggtatgct gtgatcccat ctctttcaaa ataatttgca aattcgaaaa accgaaaaag 3660 

237 gc?aaatctc atacgaattt gatattttta gtttcttaga gtcggtgatg taatttcagt 20 
' 238 Lctgaacgc aaatctcttg tccaaaggtt aaacatattg gcagagcttc tctgcttcgg 3780 

239 ggatcgtgaa ttctacaaag attggtggaa tgcaaaaagt gtgggagatg Jgagctattt 3840 

240 Kctcaaaag aaaacttatg atttttaatg ttgtcgttgt ttttgggtca tctaactaac 3900 

241 caaattcatg tattcactgt cttcctttat cagtactgga gaatgtggaa tatggtatgg 3960 
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242 ttctcttcct aaacatcacc ttcttttgta cacaaaatag aagaagagag ctaattaaga 4020 

243 tcttgttttc cttgacagcc tgttcataaa tggatggttc gacatatata cttcccgtgc 4080 
24 ttgcgcagca agaLccaaa ggtgagtgag atatataccg atatgcaatt gtcgagattt 140 

245 gtttctgtga tataaattta accctccaca cacttgtttt tcagacactc gccattatca 4200 

246 SgcttLct agtctctgca gtctttcatg aggtatacat actttctaca ttgccctgtc 260 

247 tSagacgca tgaacacacg ctagtgaaag aaatgctaat attcaaagca ttgtttttac 4320 
28 ttaacgaLt tgtgttacaa atttcctttt gacagctatg catcgcagtt ccttgtcgtc 

249 tcttcaagct atgggctttt cttgggatta tgtttcaggt taaaaaatta Jtaaactgct 4440 

250 gcagtcgatt tttactaaac tctaatctca tattctgacc aaccaatttg tttgagtagg 450U 

251 ?gcctt?ggt cttcatcaca aactatctac aggaaaggtt tggctcaacg gtatgctctc 4560 

252 aaaacccgag aaaatagaac gaataactct ttctttcata gcctagccat ttaaatcgca 4620 
III atgctg^aac ttaataataa aggtgatctg ttttggaatg ^atcatatt attaggtggg 4680 
254 aaacataatc ttctggttca tcttctgcat tttcggacaa ccgatgtgtg tgcttcttta 474U 

55 Jtaccacgac ctgatgaacc gaaaaggatc gatgtcatga aacaactgtt -aaaaatga 800 

56 ctttcttcaa acatctatgg cctcgttgga tctccgttga tgttgtggtg ? ttctgatgc 8 0 

257 taaaacgaca aatagtgtta taaccattga agaagaaaag aaaattagag ttgttgtatc 49^0 

258 ^gcaaaaatt ttggLgaga cacgcgaacc cgtttggatt ttgttatggt ? taaagaaat 4980 
"259 tJeaatcaaa aaactgttgt aataattgtt accaaaaaga aatgcttttc tggaaacgag 5040 

260 gggaaaaata gtagttttgt taggttttac tgtttggacc aaatctagta jaaaactttt 5100 
2 61 tgtaataagg aaaaaaaaag aacaaatgtg ataaatgcat ggggattgta tgaaaccttc 5lb0 
262 caataaagtt gattggtggt cccgttttgg gga 

265 <210> SEQ ID NO: 4 

266 <211> LENGTH: 498 

267 <212> TYPE: PRT 

268 <213> ORGANISM: mouse 

va « «rsrsi civ «ir u. d» ~ - «« «« ° iy 

5 10 

274 Ser Arg Val Ser Val Gin Gly Gly Ser Gly Pro Lys Val Glu Glu Asp 
->7<i 20 25 

277 Glu Val Arg Asp Ala Ala Val Ser Pro Asp Leu Gly Ala Gly Gly Asp 



280 Ala Pro Ala Pro Ala Pro Ala Pro Ala His Thr Arg Asp Lys Asp Gly 

?R1 50 55 60 

283 Arg Thr Ser Val Gly Asp Gly Tyr Trp Asp Leu Arg Cys His Arg Leu 

"70 7 5 

286 Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser Asn Tyr Arg Gly 

85 9° 95 

lie Leu Asn Trp Cys Val Val Met Leu lie Leu Ser Asn Ala Arg Leu 



287 
289 

290 100 



105 HO 



292 Phe Leu Glu Asn Leu lie Lys Tyr Gly lie Leu Val Asp Pro lie Gin 
993 115 120 125 

295 Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp Pro Ala Pro Cys 

135 140 
298 Val lie lie Ala Ser Asn He Phe Val Val Ala Ala Phe Gin lie Glu 



301 lyl Arg Leu Ala Val III Ala Leu Thr Glu Gin Met Gly Leu Leu Leu 

165 170 
III His Val val Asn Leu Ala Thr He He Cys Phe Pro Ala Ala Val Ala 
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L: 


12 M:283 W: 


Missing 


L: 


560 M:341 


W 


(46) " 


L 


562 M:341 


W 


(46) " 


L 


564 M:341 


w 


(46) " 


L 


784 M:341 


w 


: (46) " 



VERIFICATION SUMMARY DATE : 08/03/2001 

PATENT APPLICATION: US/09/623 , 514A TIME: 14:00:19 

Input Set : A:\43922new.txt 
Output Set: N:\CRF3\08032001\I623514A.raw 

Missing Blank Line separator, <140> field identifier 



■n" or "Xaa" used, for SEQ ID#:6 
'n" or "Xaa" used, for SEQ ID#:10 
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